(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
20 September 2001 (20-09.2001) 




Hill 



PCT 



III 



(10) International Publication Number 

WO 01/69899 A2 



H04M 3/00 



(51) International Patent Classification 7 : 

(21) International Application Number: PCT/U SO 1/07686 

(22) International Filing Date: 12 March 2001 (12.03.2001) 

English 
English 



(25) 



Filing Language: 
(26) Publication Language: 



(30) Priority Data: 

09/524342 



13 March 2000 (13.03.2000) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

ye 09/524342 (CON) 

Filedon 13 March 2000 (13.03.2000) 

(71) Applicant (for all designated Stales except US): NOR- 
I TEL NETWORKS LTD. [CA/CA]: World Trade Center 
j of Montreal, 380 St. Antoine Street West, 8th Boor, Mon- 
j treal, Queeec H2Y 3Y4 (CA). 



(72) Inventors; and ' 
(75) Inventors/Applicants (for US only): SOLLEE, Patrick, 
N. [US/US]: 2708 Garden Springs, Richardson. TX 75082 
(US) CREECH, David, R. [US/US]; 2526 Fallview Lane. 
Carrollton, TX 75007 (US). OSTERHOUT, Gregory, 
T [US/US]; 313 Falcon Court. Coppell, TX 75019 (US). 
JESSEN, Christopher, L. [US/US]; 107 LedgenesL, 
McKinney, TX 75070 (US). 

(74) Agent: HU, Dan, C; Trop, Pruner & Hu, P.C., 8554 Katy 
Freeway, Suite 100, Houston, TX 77024 (US). 

(81) Designated States (national): AE, AG, AU AM, AT. AU. 
AZ;BA.BB,BG,BR,BY,BZ,CA,CH,CN,CR,CU.CZ, 

DE, DK. DM, DZ, EE, ES, FI, GB, GD, GE, GH, CM, HR, 
HU, ID, 1L, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO NZ,PL,PT,RO.RU,SD,SE,SG,SI,SK,SL,TJ,TM, 
TR, IT. TZ, UA, UG, US, UZ, VN. YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD. RU,TJ, TM), European 

[Continued on next page] 



= (54) Title: CONTROLLING VOICE COMMUNICATIONS OVER A DATA NETWORK 



< 

ON 
ON 
OO 
Os 

o 
O 




(57) Abstract: A method and apparatus of communicating over ^ -work 

control system (32. 36) for call control and to display mformauon relatmg £ ■ « J < ^J^lSc 00 to establish a call session 
and the remote device. 



DOCIO: <WO 01G93SSA2.I ._> 



WO 01/69899 . A2 1 EIIU 11DIEI! II HUH 1111 1 II til 1IIH II11I illll mi mini mi nil mi 



patent (AT. BE; CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR). OAPI patent (BF, BJ, CF, 
CG, CI, CM ? G A, GN, GW, ML, MR, NE, SN, TD, TG). 



Declaration under Rule 4.17: 

— of inventorship (Rule 4. 1 7(iv)) for US only 



Published: 

— - withoitf international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'* appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



IDQCID: <WO 01G989?A2J_> 



15 



20 



25 



30 



PCT/USOl/07686 

WO 01/69899 

- 1 - 

Controlling Voice r^iminrtions Over A Data Network 

background 

The invention relates to controlling voice communications over a data network. 

Data networks are widely used to link various types of network elements, such as 
persona, computers, servers, gateways, nerwork telephones, and so form. Data networks may 
tach.de private networks (such a local area networks or wide area networks) and pubhc 
networks (such as the Internet). Popular forms of communications between network 
Cements across such data networks include electronic mail, file transfer, web browsing, and 

other exchanges of digital data. 

• With the increased capacity and reliability of data networks, voice commumcanons 
(including telephone calls, video conferencing, and so forth) over data networks have become 
possible Voice communications over data networks are unlike voice communicanons m a 
conventional public switched telephone nerwork (PSTN), which provides users wuh 
dedicated, end-to-end circuit connections for me duration of each call. Communicanons over 
data networks, such as IP (Interne, Protocol) networks, are performed using packets or , 
datagrams that are sen, in burs* from a source ,o one or more destination nodes. Vo.ee data 
sen, over a data network typically shares nerwork bandwidth with conventual no„-vo,ce 
data (e g da,a associated with electronic mail, file transfer, web access, and outer traffic). 

' Various standards have been proposed for voice and multimedia communications over 
data networks. One such standard is me H.323 Recommendation from the Interna,,onal 
Telecommunications Union (ITU), which describes terminals, equipment and serv.ces for 
multimedia communications over data networks. 

Another standard for voice and multimedia communications is the Session tamafon 
Protocol (SIP), which estabhshes, maintains, and ,ermina,=s multimedia sessions over a data 
nerwork. SIP is part ofa multimedia data and con.ro. architecture developed by , he Interne, 
Engineering Task Force (IETF). The IETF multimedia da,a and control architecture also 
includes me Resource Reservation Protocol (RSVP) for reserving nerwork resources; tire 
Real-Time Transport Protocol (RTP) for transporting real-time da* and providing qualuy of 
service (QoS) feedback; the Real-Time S<reaming Protocol (RTSP) for continuing dehvery 
of steaming media; me Session Announcement Protocol (SAP) for advertising muhtmed.a 
sessions by multicast, and the Session Description Proroco. (SDP) for describing mulnmedta 



sessions. 
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To perform voice communications over a data network, a typical computer system 
(such as a desktop computer system or a portable computer system) may be equipped with 
voice processing capabilities. Such capabilities include a microphone, ear phones or 
speakers, and speech processing software. Typically, the speech processing software 
5 includes coder/decoders (CODECs) to encode and decode voice data. The voice processing 
software, including the CODECs, may be run on a microprocessor of a typical computer 
system. However, due to the intensive data processing typically required to process voice 
data, speech performance may not be optimum. For example, there may be delays associated 
with the transfer of such voice data due to the amount of time needed to process the voice 

10 data. Also, if certain types of CODECs that have less resource requirements are selected, 
voice quality may suffer. 

Also, the computer system needs to be fitted with speakers, microphones, and sound 
cards to enable speech processing. Further, such speakers, microphones, and sound cards 
may not provide the desired level of quality, or if they do, may be relatively expensive. 

15 Additionally, to add such speech processing components to a computer system may require 
some configuration to be performed by a user, a process that an unsophisticated user may 
have difficulty with. 

Unless a computer system with powerful processing capabilities are provided, the 
voice quality provided by such computer systems are not at the level typically experienced 

20 (and expected) by users of standard telephones. Such "standard" telephones may include 

analog telephones coupled to a local or central switching office or digital telephones coupled 
to a private branch exchange (PBX) system. More recently, network telephones have been 
developed that are capable of being connected directly to a data network, such as an IP 
network. These network telephones are capable of placing telephony calls over a data 

25 network. The voice quality offered by such telephones are typically superior to those that can 
be offered by computer systems, since such network telephones typically include dedicated 
digital signal processors (DSPs) that perform the data intensive calculations involved in 
speech processing. However, the existing network telephones do not provide desired 
multimedia presentation capabilities such as those offered by displays of computer systems. 

30 Thus, while network telephones offer superior speech capabilities, it does have the desired 
multimedia capabilities. On the other hand, computer systems have superior multimedia 
capabilities, but they suffer from relatively poor speech processing performance. 
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A need thus exists for an improved method and apparatus for controlling voice 
communications over data networks. 

" general, according to one embodiment, a method of communicating over a data _ 

- • • , ,™trol svstem one or more control messages over the 

conno, system such as a compute infonnation relating to the call 

user to perform call control and ,0 vew *an. a ^ ^ 

session. Thus, voice quality assoctated wtth call sesstons 

,. -..^hancsdusinK embodiments of the invention. 

switched data networks ts enhanced using following description. 

Other features and advantages will become apparen, from the following 

from the drawings, and from the claims, 
iwf TW.riniio- rifTh^ Drawings 

^T^o^nTfan embodiment of a communications system 

Z. 2 illustrates component in a network telephone and a ca„ control system tn 
accordance with an embodiment. 
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Fig. 3 illustrates control and data paths between network elements used during a call 
session in accordance with one embodiment. 

Figs. 4 and 5 illustrate example screens displayed by the call control system of Fig. 2 
in accordance with an embodiment. 
5 Fig. 6 is a message flow diagram of messages exchanged between network elements 

in the communications system of Fig. 1 for processing an incoming call. 

Fig. 7 is a message flow diagram of messages exchanged between network elements 
in the communications system of Fig. 1 for placing an outgoing call. 

10 Detailed Description 

In the following description, numerous details are set forth to provide an . 
understanding of the present invention. However, it will be understood by those skilled in the 
art that the present invention may be practiced without these details and that numerous 
variations or modifications from the described embodiments may be possible. For example, 

15 although reference is made to Session Initiation Protocol (SIP) communications sessions in 
accordance with some embodiments, other protocols may be performed in further 
embodiments. 

Referring to Fig. 1, a communications system 10 includes a first data network 12 and 
a second data network 14 that are coupled through a data network cloud 16. The data 

20 network cloud 16 may inblude various links, communications paths, and routers for routing 

messages between data networks 12 and 14. The data network cloud 16 may include a public 
network such as the Internet. The data networks 12 and 14 may be private networks such as 
local area networks (LANs) or wide area networks (WANs). In the ensuing discussion, one 
or some combination of the data networks 12 and 14 and data network cloud 16 may be 

25 referred to collectively as the data network 1 1. As used here, a "data network" or "network" 
may refer to one or more communications networks, channels, links, or paths and systems 
(such as routers) used to route data over such networks, channels, links, or paths. 

The data network 1 1 may include an Internet Protocol (IP) network, which is a 
packet-switched network. One version of IP is described in Request for Comments (RFC) 

30 791, entitled "Internet Protocol," dated September 1981. Other versions of IP, such as IPv6, 
or other connectionless, packet-switched standards may also be utilized in further 
embodiments. A version of IPv6 is described in RFC 2460, entitled "Internet Protocol, 
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Version 6 (IPv6) Specification," dated December 1 998. Packet-switched data networks such 
as IP networks communicate with packets, datagrams or other units of data over the data 
networks. Unlike circuit-switched networks, which provide a dedicated end-to-end 
connection or physical path for the duration of a call session, a packet-switched network is 
5 one in which the same path may be shared by several network elements. Packet-switched 
networks such as IP networks are based on a connectionless internetwork layer. Packets or 
other units of data injected into a packet-switched data network may travel independently 
over any path (and possibly over different paths) to a destination point. The packets may 
even arrive out of order. Routing of the packets is based on one or more addresses carried in 

10 each packet. . 

The packet-based network '12 may also be connection-oriented, such as an ATM 
(Asynchronous Transfer Mode) network or a Frame Relay network. In a connection- 
oriented, packet-based network, a virtual circuit or connection is established between two end 
points. In such connection-oriented networks, packets are received in the same order in 
15 which they were transmitted. 

Network elements connected to the data network 1 1 may also be coupled through a 
data network-PSTN gateway 20 to a public-switched telephone network (PSTN) 22. The link 
between the gateway 20 and the PSTN 22 may be a primary rate interface (PRI) link 
according to ISDN (Integrated Services Digital Network). Standard non-data network 
20 telephones 24 may be coupled to the PSTN 22. Call sessions can thus be established between, 
a data network element and one of telephones 84. 

In the example embodiment as illustrated in Fig. 1, audio (e.g., voice) and multimedia 
(e.g., audio and video) communications may occur over the data network 1 1 between or 
among various network elements, including network telephones 30 and 34 and call control 
25 systems 32 and 36. Other devices capable of voice or multimedia sessions include SIP 

(Session Initiation Protocol) client systems 38 and 40. The SIP client systems 38 and 40 are 
capable of communicating using SEP messaging to establish call sessions. As used here, a 
"call session" refers generally to either a voice or a multimedia session established between 
two or more elements coupled to the data network 1 1 (or any other packet-switched data 
30 network). SIP is part of the multimedia data and control architecture from the Internet 

Engineering Task Force (IETF). A version of SIP is described in RFC 2543, entitled "SIP: 
Session Initiation Protocol," dated August 1999. SIP maybe used to initiate call sessions as 
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well as to invite members to a session that may have been advertised by some other 
mechanism, such as electronic mail, news groups, web pages, and other mechanisms. The 
other protocols in the IETF multimedia and control architecture include the Resource 
Reservation Protocol (RSVP), as described in RFC 2205; the Real-Time Transport Protocol 
5 (RTP), as described in RFC 1889; the Real-Time Streaming Protocol (RTSP), as described in 
RFC 2326; the Session Description Protocol (SDP), as described in RFC 2327; and the 
Session Announcement Protocol (SAP). 

Other standards may be employed in further embodiments for controlling call sessions 
over the data network 1 1 . Such other standards may be any other standard that provides for 

10 interactive, real-time voice communications over the data network. 

The SIP client systems 38 and 40 as shown in Fig. 1 include client application . 
programs that are capable of sending SIP requests to perform call requests. The systems 38 
and 40 may also be SIP servers. A server according to SIP may be an application program 
that accepts SIP requests to service calls and to send back responses to SEP requests. Thus, a 

15 system can be either a SIP client or a SIP server. A SIP proxy system, such as system 42, 
may include an intermediary program that acts as both a server and a client for making 
requests on behalf of other clients. 

In the system 10 as shown in Fig. 1, the call control systems 32 and 36 are SEP- 
enabled; that is, the call control systems 32 and 36 are capable of sending and accepting SIP 

20 requests to establish call sessions. The call control systems 32 and 36 may be implemented 

on a standard computer system platform. Unlike the call control systems 32 and 36, however, 
the network telephones 30 and 34 are not SIP-enabled in one embodiment. Although they are 
capable of communicating audio data over the data network 1 1, the network telephones 30 
and 34 are not enabled to send or accept SIP messages (or other types of messages for 

25 establishing interactive, real-time voice communications) to establish call sessions. In 

accordance with some embodiments, the establishment, management, and termination of call 
sessions are controlled by the call control systems 32 and 36. Thus, the call control system 
32 makes SIP requests on behalf of the network telephone 30, while the call control system 
36 makes SIP requests on behalf of the network telephone 34. Once a call session is 

30 established, the network telephone 30 or 34 participates in the communication of voice data 
over the network 11. 
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By employing the arrangement as shown in Fig. 1, the superior voice capabilities of 
network telephones 30 and 34 may be utilized to provide enhanced voice quality for users 
making telephony calls over the data network 1 1 . At the same time, associated call control 
systems 32 and 36 are used to provide call signaling communications and to provide the user 
5 with a convenient user interface to perform call control as well as display information 
associated with the call session. 

The call control system 32 and the network telephone 30 may be collectively referred 
to as a telephony system 31 . Similarly, the call control system 36 and network telephone 34 
m ay be collectively referred to as a telephony system 33. To establish a call session between 
10 the telephony system 31 or 33 and another SIP-enabled remote system 100, as shown m F,g. 

3 the call control system 32 or 36 sends SIP messages to the remote system 100 to establish, a 
. call session. The remote system 100 may be any system or device on the data network 1 1 

that is capable of participating in a SIP -established call session. The call control system 32 or 
36 also exchanges commands according to a predetermined format with the network 
15 telephone 30 or 34 to let the network telephone 30 or 34 know of the current status of the call 
setup Once a call is established, a link may be established between the network telephone 30 
or 34 and the remote system 100 over the data network 1 1. The link may be a Real-Time 
Protocol (RTP) link to communicate with voice data. Thus, in the telephony system 31 or 33, 
the call control system 32 or 36 communicates the control signaling to establish a call 
20 session, while a real-time link is established directly between the network telephone 30 or 34 
and the remote system 100 for communicating voice or other types of audio data. In one _ 
embodiment, the call control messaging between the call control system and remote system, 
the control messaging between the call control system and the network telephone, and the call 
session between the network telephone and the remote system all occur over the data network 

The call control system 32 or 36 is also equipped with speech processing elements to 
allow it to communicate voice data with other devices on the data network 1 1 . Thus, a user at 
the call control system 32 or 36 may select whether to use the call control system or the 
network telephone as the terminal device in the established call session. In addition, if the 
30 call control system 32 or 36 is powered off, the network telephone 30 or 34 may be used as a 
stand-alone device to communicate voice in call sessions over the data network 11. 
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Referring to Fig. 2, the components in the network telephone 30 or 34 and in the call 
control system 32 or 36 are illustrated in greater detail. The network telephone 30 or 34 
includes a network interface 102 that is coupled to the data network 1 1 . Above the network 
interface 102 are several software layers, including a device driver layer 104, a TCP/IP or 
5 UDP/EP stack 106, and an RTP layer 108. TCP is described in RFC 793, entitled 

"Transmission Control Protocol," dated September 198 1 ; and UDP is described in RFC 768, 
entitled "User Datagram Protocol," dated August 1980. TCP and UDP are transport layers 
for managing connections between network elements over an IP network. Packets received 
by the network interface 102 are passed up through the several layers 104, 106 and 108. 

10 Control packets are transmitted by the TCP/IP or UDP/IP stack 106 to one or more control 
tasks 1 10 in the network telephone 30 or 34. The one or more control tasks 1 10 may be 
implemented as software routines executable on a control unit 112. Instructions and data 
associated with the control tasks 1 10 may be stored in a storage device 1 14. The control 
tasks 1 10 are responsible for generation of control signaling as well as exchanging commands 

15 and responses with its associated call control system 32 or 36 over the data network 11. 

Voice data may be passed through the RTP layer 108 to a speech processing 
application 1 16, which may also be executable on the control unit 1 12. For faster processing 
of voice data, a digital signal processor (DSP) 1 1 8 is included in the network telephone 30 or 
34 to provide data intensive signal processing tasks. For example, the coder/encoder 

20 (CODEC) may be implemented in the DSP 118. The network telephone may also include a 
display screen to display text data associated with a call session. The size of the display 
screen 1 20 may be limited so that only limited amounts of text data may be displayed in the 
display screen 120. The network telephone also includes numerals buttons that may be 
controlled by button control circuitry 122. The buttons may include numeric buttons, speed 

25 dial buttons, a transfer button, a hold button, a redial button, and other telephony buttons. 
Activation of any one of the buttons may cause generation of some type of an indication 
(such as an interrupt) that is forwarded to the control tasks 110. 

The call control system 32 or 36 also includes a network interface 150. Above the 
network interface 150 are several layers, including a device driver layer 152, a TCP/IP or 

30 UDP/IP stack 154, a SIP stack 156, and an RTP layer 158. The SIP stack 156 is responsible 
for processing or generating SIP requests and responses communicated over the data network 
1 1 . The SIP stack 156 is in communication with one or more control tasks 1 60 in the call 
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control system 32 or 36. The SIP stack 1 56 is generally a state machine that provides 
parsing, processing, and generation of SIP requests and responses. 

The call control tasks 1 60 are responsible for generating control signaling to establish 
call sessions over the data network 1 1 as well as to respond to received control signaling. In 
5 addition, the control tasks 160 are responsible for exchanging commands and responses with 
the network telephone 30 or 34 to establish such call sessions. The call control system 32 or 
36 may also include one or more graphical user interface (GUI) routines 162 that control the 
presentation of information (text or graphical) on a display 164 of the call control system. 
Further, the user interface provided by the GUI routines 162 may include selectors for call 
10 control and indicators of the status of a call session. 

- In the illustrated arrangement, the RTP layer 158 sends audio data to, or receives 
audio data from, a CODEC 166. TheCODEC 1 66 encodes or decodes voice data. A speech 
processing routine 168 may perform further processing of voice data. In further 
embodiments, the audio CODEC 166 and the speech processing routine 1 1 8 may be omitted. 
The various software routines in the call control system 32 or 36, including the various layers 
152 154 156, and 158 as well as the control tasks 160, CODECS 166, speech processing 
routine 168, and GUI routine 162, are executable on a control unit 170. The control unit 170 
is coupled to a storage device 172 in which instructions and data associated with the various 

software routines may be stored. 

In the illustrated example arrangement, to provide a voice or audio user interface to a 
user sitting at the call control system 32 or 36, a peripheral controller 174 is coupled to a 
microphone 176 and a speaker or head phone 178 through which a user can talk or listen 
during a call session. If the call control system 32 or 36 is not speech-enabled, the 
microphone 176 and speaker or head phone 178 may be omitted. 
25 One call control system 32 or 36 may be associated with a corresponding network 

telephone 30 or 34. Thus, the network telephone 30 or 34 can identify which device is its 
controller. Similarly, a call control system 32 or 36 can identify the network telephone it is 
controlling. The network telephone 30 or 34 includes one or more fields 120 in the storage 
device 1 1 4 to store an identifier of its call controller, in this case the call control system 32 or 
30 36. The identifier may be in the form of a network address and port number. For example, 
an IP address and a TCP or UDP port may form part of the identifier of the call controller 
120. Similarly, the call control system 32 or 36 stores one or more fields 180 in the storage 
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device 172 that stores the identifier of the network telephone it is controlling. Again, the 
identifier 1 80 may be in the form of a network address and port number, such as an IP 
address and a TCP or UDP port number. The identifier stored in the field 120 of the network 
telephone may be changed by a user to change the associated call control system. Similarly, 
5 the identifier stored in the field 1 80 of the call control system may be modified to change the 
controlled network telephone. 

In further embodiments, one call control system may be associated with plural 
network telephones. Also, a single network telephone may be associated with plural call 
control systems. 

10 The various control units in the network telephone 30 or 34, the call control 32 or 36, 

and any other system or device on the data network 1 1 may each include a microprocessor, a 
microcontroller, a processor card (including one or more microprocessors or controllers), or 
other control or computing devices. The storage devices referred to in this discussion may 
include one or more machine-readable storage media for storing data and instructions. The 

1 5 storage media may include different forms of memory including semiconductor memory 
devices such as dynamic or static random access memories (DRAMs or SRAMs), erasable 
and programmable read-only memories (EPROMs), electrically erasable and programmable 
read-only memories (EEPROMs) and flash memories; magnetic disks such as fixed, floppy 
and removable disks; other magnetic media including tape; and optical media such as 

20 compact disks (CDs) or digital video disks (DVDs). Instructions that make up the various 
, software routines, modules, or layers in the various network elements may be stored in 
respective storage devices. The instructions when executed by a respective control unit cause 
the corresponding network element to perform programmed acts. 

The instructions of the software routines, modules or layers may be loaded or 

25 transported to the network element in one of many different ways. For example, code 

segments including instructions stored on floppy disks, CD or DVD media, a hard disk, or 
transported through a network interface card, modem, or other interface device may be 
loaded into the system and executed as corresponding software routines, modules, or layers. 
In the loading or transport process, data signals that are embodied in carrier waves 

30 (transmitted over telephone lines, network lines, wireless links, cables, and the like) may 

communicate the code segments, including instructions, to the network element. Such carrier 
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waves may be in the form of e.ectrica,, optical, acoustical, elecrromagnetic, or other types of 

SigMlS Referring to Fig. 4, in accordance wirh one embodiment, a screen 200 that may be^ 
provi ded by the contro, ras.cs 160 and graphical user interface routines .62 in the call con.o, 
s ,em 32 r 36 is i.lusrrared. The screen 200 as shown in Fig. 4 includes vanous tcons ami 
7L (general* referred ,0 as indicators) to a,.ow a user sitting a, me ca„ contro, system o 
« t termin te, and screen ca.ls over ,he da,a network U . m .he example shown ,n F,g. 4 
■ : 2 To0 includes a menu 202, a series of control buttons 204, and a list 206 of porenua, 
^The lis, 206 provides Ore firs, and las, names of potential callees as we,, — 
e.eclronic mai, addresses (or other informa,ion such as .elephone numbers and so orth). M 
■ - ^Ld in Fig. 4, the name R Smith may be b ig b,igh,ed .in ,be ,is, 206. The address of R. 
Smith is displayed in an address f,e,d 208. The address field 208 may include -nous 
fola. such as a PSTN number (e.g., 972.555-.234); a PSTK number and a proxy address 
17 ^2 555-1234 @ CTEXI300); an D> address (e.g., 47, 6.,8.72); a SIP address (e.g., 
rsrmth^ortetaetworks.com); or a SIP address a, a specific TP address (e.g., 
rsmi,h@47,6 1 .,8.72). Identifiers according ,o other forma,s may be i.,us,ra,ed m ,he 
address field 208 in further embodiments. 

A aratus fie.d 2,2 may a!so be inc.uded in ,he screen 200. whtch may show ,he s,a«* 
as "no, in ca«." "outgoing ca,. ,0 R. Smith," "incoming ca.l from R. Smith," and so forth. 
, . ■ ^a, ir yo f indicators2 1 4maya,s„beprovided,n,hescree„200. A C indicator flashes 
LnanineomingcaUhasbeenmissed. An S indicator gives an indicahon 
screeningis aCive. A P indicator gives an indica.ion that a SIP proxy 'J ™°™ 
An E indicator gives an indication of fire s,a,e of ,h= associated nerworlc 
B indicator is a, a firs, state if ,he network .elephone is no, active and a, a secon s a the 
!5 network lelephone is active and available. The E indica.or may a,so be a, a fiurd share ,o 
indicate that a call is currently in progress. 

The screen 200 is a,so capab.e of providing a pop-up menu 2,0 ,0 auow a user ,0 
ae.ec, one or severs, methods of contactmg the desired ca,.ee. For example, a first optron m 
^pop-up menu 2,0 is ,0 ca„ R. Smim. Another option is to send an electron, ma„ ,0 R. 
30 Smith. A third option is to go to R. Smith's web s.te. 
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Other call control operations that may be performed by a user through the screen 200 
includes volume control, screening of incoming calls, termination of a call session, and other 
operations. 

Referring to Fig. 5, once a call is established with either a caller or a callee, another 
5 screen 300 may be shown. A picture of the caller or callee may be displayed in the screen 
300. An icon 304 may be provided to allow the user to hang-up the call, and another icon 
306 may be provided to allow the call to be placed on hold. A status field 308 indicates the 
current status of the call. 

Referring to Fig. 6, a message flow between a network telephone, a call control 

10 system, and a remote system is illustrated. According to SIP, messages that may be 

exchanged between network elements include requests and responses. The remote system 
may be another call control system, one of the SEP client systems 38 and 40, the data 
network-PSTN gateway.20, or any other system capable of establishing a call session on the 
data network 11 . The remote system first sends (at 402) an Invite request (according to SIP) 

15 to the call control system. The Invite request indicates that the receiving node is being 
invited to participate in a session. The message body of the Invite request contains a 
description (e.g., in SDP format) of the session to which the receiving node is being invited. 

The call control system may then send (at 404) a Ringing (SEP) response back to the 
remote system. The Ringing response indicates that the called user agent has located a 

20 possible location where the user has registered recently and is trying to alert the user. The 
call control system may then send (at 406) a Connection_Req message to the network 
telephone to initiate a connection between the call control system and the network telephone. 
The messaging format between the network telephone and the call control system may be any 
predetermined format that allows call establishment and control to be performed by the call 

25 control system with the network telephone. One such format is the Unified Networks IP 

Stimulus Protocol, Draft Version 2.1, dated December 7, 1999. In further embodiments, other 
interface protocols may be employed. A description of one embodiment of a protocol for 
message exchange between the network telephone and the call control system is provided in 
U.S. Patent Application Serial No. 09/307,356, entitled "Telephony and Data Network 

30 Services at a Telephone," filed on May 7, 1999, which is hereby incorporated by reference. 

The Connection_Req message is a generic message which includes one or more 
commands that indicates a request to establish a connection. The Connection_Req message 
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may actually include a ring command to activate the ringer of the network telephone and 
other commands to activate the network telephone, such as activation of the handset, headset, 
microphone, speaker, and so forth. The network telephone may then send back (at 408) an 
Ack_Req message to the call control system to acknowledge that the network telephone is 
5 available and ready. The Ack_Req message may also be a generic message to acknowledge 
receipt of the ConnectionReq message. Upon receipt of Ack_Req message from the 
network telephone, the call control system sends (at 410) a 200 OK SIP response to the 
remote system to indicate that the request has succeeded. The remote system then sends (at 
412) an Ack request (according to SIP) to the call control system. The Ack request confirms 
10 that the client has received a final response to ah Invite request. 

Upon receipt of Ack request, the call control system sends (at 414) a Remote_Answer 
message to the network telephone to indicate a request to establish a path for a call session. 
If accepted, the network telephone then sends (at 416) an AckAnswer message back to. the 
call control system. The Remote_Answer message may be a generic message that includes 
15 one or more commands to activate the network telephone for call session. One such 
command is a command to open or connect the audio stream to the handset, headset, 
microphone and speaker of the network telephone. At that point, a voice path is established 
(at 41 8) directly between the network telephone and the remote system. The voice path may 
be an RTP link over the data network 11. 
20 To terminate the call, the remote system may issue (at 420) a Bye request to the call 

control system. The call control system then responds (at 422) with a 200 OK, indicating that 
the call has been terminated. Then, the call control system sends (at 424) a Disconnect_Req 
message to the network telephone to disconnect the network telephone from the data network. 
The Disconnect_Req message be a generic message including one or more commands to 
25 deactivate various components of the network telephone. For example, the audio stream may 
be closed or disconnected, and the handset, headset, microphone, and speaker may be 
deactivated. The network telephone then returns (at 426) an Ack_Disconnect message back 
to the call control system to indicate that the call has been disconnected. 

Referring to Fig. 7, an outgoing call message flow is illustrated. In the illustrated 
30 example, the user can initiate the call from the call control system. However, the user can 
also make the external call from the network telephone by entering the desired number in 
appropriate buttons of the network telephone. In that case, messages are exchanged between 
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the network telephone and the call control system initially to indicate to the call control 
system that the user has started a phone call from the network telephone. 

To start the call session, the call control system sends (at 502) an Invite request to the 
remote system. The remote system then sends back (at 504) a Ringing response. In 
5 response, the call control system sends (at 506) a Remote_Alerting message to the network 
telephone indicating that the call has been placed. The network telephone then returns (at 
508) an Ack_Alerting message. At some point, the remote system, once it has answered the 
call, issues (at 510) a 200 OK message to the call control system. In response, the call 
control system then sends (at 5 12) an Ack request back to the remote system. The call 

10 control system also sends (at 514) a Remote_Answer message to the network telephone, 

which returns (at 516) an Ack_Answer message to the call control system. At that point, a 
voice path (e.g., an RTP path) is established (at 51 8) between the network telephone and the 
remote system over the data network 11. 

To terminate the call, the remote system may issue (at 520) a Bye request. In 

15 response, the call control system may terminate the call by sending (at 522) a 200 OK 
message. The call control system then sends (at 524) a Disconnect_Req message to the 
network telephone, which returns (at 526) an AckJDisconnect message to the call control 
system. At this point, the RTP voice path is terminated. 

While the invention has been disclosed with respect to a limited number of 

20 embodiments, those skilled in the art will appreciate numerous modifications and variations 
therefrom. It is intended that the appended claims cover all such modifications and variations 
as fall within the true spirit and scope of the invention. 
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What is claimed is: 

1 !. a method of communicating over a data network, comprising: 

2 providing a user interface in a control system for establishing call sessions; 

3 communicating, by the control system, one or more control messages over the 

4 data network to establish a call session with a remote device in response to receipt of a 

5 request through the user interface; and 

6 transmitting one or more commands to a voice device connected to the data 

7 network and associated with the control system to establish the call session between the voice 

8 device and the remote device over the data network. 

- 1 - 2 . The method of claim 1, wherein the communicated one or more control 

2 messages and the transmitted one or more commands are according to different formats. 

! 3. The method of claim 1, wherein transmitting the one or more commands to the 

2 voice device includes transmitting one or more commands to a network telephone including a 

3 network interface to the data network. 

1 4. The method of claim 1 , wherein establishing the call session includes 

2 establishing a Real-Time Protocol session over the data network. 

j 5 . The method of claim 1, wherein communicating the one or more control 

2 messages includes communicating messages according to a protocol defining real-time, 

3 interactive call sessions over a packet-switched data network. 

1 6. The method of claim 1, wherein communicating the one or more control 

2 messages includes communicating one or more Session Initiation Protocol messages. 

1 7 . The method of claim 1 , further comprising storing, in the control system, an 

2 identifier of the voice device. 

1 8. The method of claim 7, wherein storing the identifier includes storing an 

2 Internet Protocol address and a port of the voice device. 
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1 9. The method of claim 1, further comprising receiving an indication from the 

2 voice device to establish another call session with the remote device. 

1 10. The method of claim 1, further comprising displaying graphical user interface 

2 information of the call session on the control system. 

1 11. The method of claim 1, further comprising terminating the call session using 

2 either the user interface or the voice device. 

1 12. A method of communicating over a data network, comprising: 

2 in a control system, communicating one or more control messages over the 

3 data network to establish a call session with a remote device coupled to the data network; 

4 transmitting one or more commands to a voice device coupled to the data 

5 network 

6 establishing the call session between the voice device and the remote device 

7 over the data network; and 

8 displaying information associated with the call session on the control system. 

1 13. The method of claim 12, wherein displaying the information includes 

2 displaying graphical user interface information. 

1 14. The method of claim 12, wherein communicating the one or more control 

2 messages includes communicating Session Initiation Protocol messages. 

1 15. The method of claim 12, further comprising providing one or more indicators 

2 for call control in the control system. 

1 16. The method of claim 12, further comprising communicating Real-Time 

2 Protocol messages between the voice device and the remote device over the data network. 

1 17. The method of claim 12, further comprising identifying, in the control system, 

2 an address of the voice device to be controlled by the control system. 
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18. The method of claim 12, further comprising providing a user interface on a 
display of the control system, the user interface enabling selection of one or more criteria 
associated with the voice device. 

19. The method of claim 1 8, wherein the one or more criteria includes selection of 
the voice device for use in a voice session established by the control system. 

20. The method of claim 19, wherein the one or more criteria includes an 
identifier of the voice device. 

21. The method of claim 12, further comprising providing voice processing 
components in the control system and selecting one of the voice processing components and 
the voice device to communicate in the established call session. 

22. The method of claim 21, further comprising receiving user selections entered 
in a user interface of the control system to select one of the voice processing components and 
the voice device. 

23. The method of claim 21, further comprising redirecting selection to the other 
one of the voice processing components and voice device. 

24. The method of claim 12, wherein the data network includes a packet-switched 
data network. 

' 25. A system for controlling a voice device connected to a data network, 
comprising: 

a user interface including one or more selectors for call control relating to call 

sessions; 



controller adapted to receive a request from the user interface and to 
more messages foi 
session with a remote device; and 



generate one or more messages for communication over the data network to establish a call 
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8 an interface to transmit one or more commands relating to the call session to 

9 the voice device to establish a link between the voice device and the remote device over the 
10 data network. 

1 26. The system of claim 25, wherein the one or more messages include Session 

2 Initiation Protocol messages. 

1 27. The system of claim 26, further comprising a module to process the one or 

2 more Session Initiation Protocol messages. 

1 28. The system of claim 25, wherein the interface includes a network interface for 

2 coupling to the data network. 

1 29. The system of claim 25, further comprising a storage element including an 

2 identifier of the voice device. 

1 30. The system of claim 25, wherein the user interface includes one or more 

2 elements to display information relating to the call session. 

1 3 1 . The system of claim 30, wherein the information includes graphical 

2 information. 

1 32. A computer program capable of running in a system so that the system so 

2 programmed carries out a method for controlling voice communications over a data network, 

3 the method comprising: 

4 providing a user interface in the system to display information associated with 

5 a call session; 

6 communicating one or more control messages over the data network with a 

7 remote device to establish the call session between a voice device and the remote device; and 

8 controlling the voice device during the call session. 
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1 33. The computer program of claim 32, wherein communicating comprises 

2 communicating Session Initiation Protocol messages. 

1 34. The computer program of claim 32, wherein the method further comprises 

2 displaying a picture of a callee. 

j 35. The computer program of claim 32, wherein the method further comprises 

2 displaying icons selectable by a user for call control. 



1 36. A computer program product comprising at least one storage medium having 

-2 thereon computer program code means to make a system execute a method for controlling a 

3 C all session over a data network, the method comprising: 

4 providing a user interface in the system for establishing the call session; 

5 communicating one or more control messages over the data network to 

6 establish the call session with a remote device in response to a request received through. the 

7 user interface; and 

8 transmitting one or more commands to a voice device connected to the data 

9 network and associated with the control system to establish the call session between the voice 
1 0 device and the remote device over the data network. 
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(57) Abstract: Recently, network 
telephones have been developed 
that are capable of being connected 
directly to a data network, such as an 
IP network. These network telephones 
are capable of placing telephony 
calls over a data network. The voice 
quality offered by such telephones 
are typically superior to those that can 
be offered by computer systems, since 
such network telephones typically 
include dedicated digital signal 
processors (DSPs) that perform the 
data intensive calculations involved 
in speech processing! However, 
the existing network telephones 
do not provide desired multimedia 
presentation capabilities such as 
those offered by displays of computer 
systems. Thus, while networks 
telephones offer superior speech 
capabilities, it does have the desired 
multimedia capabilities. On the other hand, computer systems have superior multimedia capabilities, but they suffer from 
relatively poor speech processing performance. A need thus exists for an improved method and apparatus for controlhng voice 
communications over data networks. A method and apparatus of communicating over a data network (1 1) includes providing a 
user interface (200) in a control system (32, 36) for call control and to display information relating to a call session. The control 
system (32, 36) communicates one or more control messages (e.g., Session Initiation Protocol or SIP messages) over the data 
network (1 1) to establish a call session with a remote device in response to receipt of a request through the user interface. One or 
more commands are transmitted to a voice device (30, 34) associated with the control system (32, 36) to establish the call session 
between the voice device (30, 34) and the remote device over the data network (1 1). A Real-Time Protocol (RTP) link may be 
established between the voice device (30, 34) and the remote device. 
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